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Method and Equipment for Controlling the Congestion Management and 
Scheduling of Transmission-Link Capacity in Packet-Switched 
Telecommunications 

5 The present invention relates to a method, according to Claim 1, for controlling the 

congestion management and scheduling of transmission-link capacity in packet-switched 
telecommunications. 

The invention also relates to equipment, according to Claim 8, for controlling the 
10 congestion management and scheduling of transmission-link capacity in packet-switched 
telecommunications. 

In this publication, the following abbreviations are used in the descriptions of both the 
prior art and the invention: 

15 

Service level class for applications, which are able to exploit the 
momentarily available capacity of a data transmission network, but for 
which the capacity of the data transmission network is not reserved (Best 
Effort), 

Service level class (Class of Service), 

Data carried by a packet, stating the service level class to which the 
packet in question belongs (Differentiated Services Code Point), 
First In First Our discipline, 

A service level class for applications, which are able to exploit the 
momentarily available capacity of a data transmission network, and for 
which a specific data transmission capacity is reserved (Guaranteed rate 
and Best Effort), 

A service level class that is similar to aG+E, but, in service level class 
bG+E, an overbooking ratio of a different magnitude to that in service 
level class aG+E can be used, if desired, 

A variable expressing an internal sub-group (e.g., drop preference) of a 
service level class. 
Overbooking Ratio, 
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QoS Quality of service, 

q Variable expressing the service level class, 

SFQ Start-time Fair Queuing, one scheduling method [1] based on a v^eighting 

coefficient, 

5 SLA Service Level Agreement, 

wfq a general term (v^eighted fair queuing) applied to a scheduling method 

based on a weighting coefficient, 
WFQ Weighted Fair Queuing, one scheduling method [1] based on a weighting 

coefficient, 

10 WRED Weighted Random Early Detection, a congestion limitation method [3, 4] 

based on a weighting coefficient. 

In a packet-switched telecommunications system, it is often preferable for the packets 
being transmitted to be classified as belonging to different service level classes (CoS), 

15 according to the requirements of the applications used by the telecommunications 
service, and, on the other hand, according to the kind of agreements on the service 
quality (SLA) the telecommunications service provider has made with its customers (end 
users). For example, in the case of normal telephone applications, it is essential for the 
data transmission speed required by the application to be available for the Ume required, 

20 for the transmission delay to be sufficiendy small, and the variation in the transmission 
delay to be sufficient low. In telephone applications, there is no advantage in being able 
to momentarily increase the data transmission speed provided for an application, if the 
loading on the telecommunications network is small at the time in question. On the other 
hand, when downloading web-pages, it is extremely advantageous to be able to exploit 

25 the full, even momentarily available, transmission capacity of the network. 



It is often advantageous to use overbooking for some service level classes. An 
application representing a specific service level class, for which a specific transmission 
speed [bit/s] is ordered by the service level agreement (SLA), will be examined. The 
30 telecommunications network is required to provide the transmission speed ordered for 
the application in question, with a probability of 99.99 %. In order to meet this demand, 
the data transmission capacity [bit/s] is reserved in the data transmission links and other 
network elements for applications using the service level class in question. When using 
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overbooking, the data transmission capacity reserved in a specific link or other network 
element is lower than the total sum of the transmission speeds ordered in the service 
level agreements (SLA) in the case of the relevant part of the network. Overbooking 
naturally increases the probability of breaching the service level agreement (SLA). 
However, in practice it is improbable that even nearly all of the end users using the 
specific service level class will attempt to simultaneously utilize the transmission speed 
defined in their service level agreement. From the point of view of the service provider, 
overbooking is profitable, as long as the payments from end users received with the aid 
of overbooking (thus selling more transmission capacity) are greater than the costs 
incurred by the increase in breaches of the service level agreements. The overbooking 
ratio (OBR) expresses the ratio of the total sum of the transmission speeds ordered for 
specific traffic to the data transmission capacity reserved for the traffic in question. The 
overbooking ratio can be network element specific. 

If overbooking is used in some service level class, it should be arranged so that the 
overbooking used in the specific service level class does not reduce the quality of service 
in other service level classes. Service quality reduction can appear, for example, in the 
form of increased packet loss, of increased transmission delays and delay variations, or 
in a reduced ability of the application to utilize the available transmission capacity of the 
network at any time. The risk of service quality reduction caused by overbooking should 
affect only the service level class, in which the overbooking is used. In this publication, 
overbooking meeting such conditions is termed controlled overbooking. 

The following examines a situation, in which the telecommunications services provides 
the following types of service level class: 

-aG + E (Guaranteed rate and Best Effort) for an application, for which a service 
level agreement (SLA) is used to order a specific (minimum) transmission speed 
[bit/s] and for which the momentary data transmission speed provided is 
increased exploiting the data transmission system capacity available at each time. 
A data transmission capacity [bit/s] is reserved in the network elements for 
applications representing the aO + E service level class. 
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-bG + E: a service level class corresponding to aG + E, but in the service class 
bG + E it is possible, if desired, to use an overbooking ratio (OBRbc+E) of a 
different magnitude to that in the service level class aG + E (OBRac+E). 

5 -BE (Best Effort): for applications, for which a data transmission capacity is 

neither reserved in the network elements, nor, on the other hand, is a (minimum) 
transmission speed ordered using a service level agreement (SLA), but for which 
the telecommunications system's capacity available at any time is exploited. 

10 Figure 1 shows one way according to the prior art of scheduling the capacity of a 

common transmission link for traffic flows representing the aforementioned service level 
classes (aG+E, bG+E, or BE), The operation of the system shown in Figure 1 is as 
follows: 

15 -The service level class q, to which an individual packet belongs, can be 

identified on the basis of information attached to the packet (for example, DSCP 
= Differentiated Services Code Point [2]). 

-Packets are routed into service-level-specific FIFO queues 3-5 (aG+E, bG+E, 
20 and BE queues). 



-Each packet representing an aG+E or bG+E service level class belongs to an 
internal sub-group (p) in the service level class, on the basis of which it is 
possible to decide at least whether the packet in question belongs to the portion 

25 of the traffic that corresponds to the minimum transmission speed order in the 

service level agreement (SLA), (this will be subsequently referred to as the G 
portion), or whether the packet belongs to the portion of the traffic that exceeds 
the ordered minimum transmission speed (this will be subsequently referred to as 
the E portion). Membership of a specific sub-group p can be indicated, for 

30 example, with the aid of drop precedence information carried by the DSCP [2]. 

Hie sub-group information is used when congestion requires a decision to be 
made as to the packets in the queue or arriving in the queue on which congestion 
limitation measures should be imposed. An example of this is the WRED method 
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(Weighted Random Early Detection) [3, 4]. 

-The capacity of the transmission link is scheduled between the aG+E, bG+E, 
and BE queues 5, using a weighting coefficient based scheduling method (for 
5 example SFQ [1]). In a congestion situation, the capacity of the transmission link 

is divided between the aG+E, bG+E, and BE service level classes, in a ratio 
determined by the corresponding weighting coefficients (Wac+E : Wbo+E : Wbe). 

In the scheduling method shown in Figure 1, the weighting coefficients Wac+E, Wbc+E, 
10 and Wbe are chosen while bearing in mind that the traffic representing the service level 
classes aO-E, and bG+E must receive the portions of the capacity of the transmission 
link reserved for them. A problem in the system shown in Figure 1 is that, in addition to 
meeting the aforementioned requirement, it is not possible to define the weighting 
coefficient by which the E portions of the traffic representing the service level classes 
15 aG+E and b+E and the BE traffic will compete for the portion of the capacity of the 
transmission link that is not either reserved for the use of traffic representing some 
service level class, or is reserved, but is not being used at the moment in question by 
traffic entitled to the reservation. 

20 Figure 2 shows a method according to the prior art disclosed in reference [5] 

(confidential at the time of writing the present application), in which the value of the 
scheduling weight depends on both the quality class (q) and the sub-group (p). It is then 
possible to separately define 1) what relative portion of the capacity of the transmission 
link will be given to the portion of traffic of each service level class that corresponds to 

25 the ordered minimum transmission speed (the G portions of aG+E and bG+E) and 2) 

with what weighting coefficient the traffic portion, which exceeds the ordered minimum 
transmission speed (the E portions of aG+E and bG+E and BE), will compete for that 
portion of the capacity of the transmission link, which is not at the moment under 
examination being used by a traffic portion (the G portions of aG+E and bG+E) 

30 representing the minimum transmission speed ordered for some service level class. 

In the system according to Figure 2, the traffic portions (the G portions of aG+E and 
bG+E) entitled to the reservations should be given, on the one hand, sufficiently large 
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scheduling weights relative to the scheduling weights of the traffic portions (BE and the 
E portions of aG+E and bG+E) unentitled to the reservations, so that it is possible lo 
ensure that the traffic portions entitled to the reservations receive the use of the 
transmission capacity portions reserved for them, even in a congestion situation. On the 

5 other hand, however, the scheduling weights in questions should be small enough so that 
the overbooking to be used in traffic portions entitled to the reservations will reduce the 
performance of only the service level class in which overbooking is used. A problem in 
the system according to Figui« 2 is that it is only in exceptional cases that the said 
requirements affecting the scheduling weights (ensuring reservations, division of the 

10 available transmission capacity into the desired ratios, and controlled overbooking) will 
not be mutually contradictory. 

An additional problem in the methods shown in Figures 1 and 2 is that, in a situation in 
which, for example, the queue 3 of the aO+E quality class has become congested in the 

15 quality class in question due to overbooking being used, the congestion limitation 

mechanism (e.g., WRED [3, 4]) will not be able to limit the length of the queue in a 
manner corresponding to that in a situation, in which the congestion is due to traffic 
representing the E portion being offered. This is because the congestion limitation 
mechanism uses sub-group information (e.g., drop precedence) to decide which packets 

20 to apply the congestion limitation measures to. when the queue length and/or a variable 
derived from it exceeds a specific threshold value. If the sub-group information states 
that the packet belongs to the G portion, a higher threshold value is used, which the 
queue length or its derivative must exceed before a congestion limitation measure is 
applied to the packet in question, than in a situation in which the packet being examined 

25 belongs to the E portion. When using overbooking, the queue can become already 

congested due to the effect of only the G portion. Any increase in the length of the queue 
will increase the transmission delay and hampers the operation of, for instance, TCP 
protocol flow control and monitoring mechanisms [6]. 

30 The present invention is intended to eUminate the defects of the state of the art described 
above and for this purpose create an entirely new type of method and equipment for 
scheduUng transmission link capacity between packet-switched traffic flows. The object 
of the invention is a method, by means of which a schedule and congestion 
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management mechanism can be implemented, in such a way that the following 
properties are achieved: 

1) A specific portion of the capacity of the transmission link can be reserved for 
traffic representing a specific service level class, and 

2) it is possible to define the weighting coefficient by which each portion of the 
traffic of the service level class, which exceeds the capacity of the portion of the 
transmission link reserved for the service level class in question, will compete for 
the portion of the capacity of the transmission link, which is either not reserved 
for the use of traffic representing some service level class, or which is reserved 
but is not being used at the moment in question by traffic entitled to the 
reservation, and 

3) it is possible to use overbooking in such a way that the reduction in the quality of 
the service caused by overbooking only affects the service level class, in which 
overbooking is used (controlled overbooking), and 

4) an increase in the queue length that is detrimental in terms of the traffic-flow 
control (e.g., using the TCP protocol [6]) can be prevented even in a congestion 
situation arising from overbooking. 

The invention is based on measuring the traffic flow coming to be scheduled, in which 
the traffic flow mentioned is formed of packets arriving in a queue representing a 
specific service level class, or some of the packets in question, and the operation of the 
scheduler (e.g., SFQ [1]) and the congestion-limitation mechanism (e.g., WRED [3, 4]) 
is controlled on the basis of the measurement. 

The use of the method according to the invention purely to control the scheduler 
mechanism does riot prevent the use of a traditional congestion limitation method based 
on sub-group information (e.g., drop precedence). Using the method according to the 
invention purely to control a congestion limitation mechanism does not prevent the use 
of scheduling methods according to the prior art. 
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The measurement result can be a single number, the value of which expresses 
information to be utilized in control, or many number (vector), the values of which 
express information to be utilized. In the following, the measurement result will be 
5 treated as a vector formed of several sub-results, as it is the most general approach. 

The method according to the invention is characterized by what is stated in the 
characterizing portion of Claim 1. 

10 The equipment according to the invention is, in turn, characterized by what is stated in 
the characterizing portion of Claim 8. 

The use of the invention achieves the advantage over solutions according to the prior art 
that it is possible to implement the scheduler and congestion-limitation mechanism in 
15 such a way that the reduction in quality arising from overbooking only affects the 

service level class in which overbooking is used and, in addition, can prevent an increase 
in the length of the queue that is detrimental to traffic-flow control, even in a congestion 
situation arising from overbooking. 

20 In the following, the invention is examined in greater detail with the aid of examples 
according to the accompanying figures. 

Figure 1 shows a block diagram of one system according to the prior art, for scheduling 
the capacity of a conmion transmission link for traffic flows representing the 
25 aforementioned service level classes (aG+E, bG+E, BE). 

Figure 2 shows a block diagram of a second system according to the prior art, for 
scheduUng the capacity of a common transmission link for traffic flows representing the 
aforementioned service level classes. 



30 



Figure 3 shows a block diagram of a system according to the invention, for scheduling 
the capacity of a common transmission link for traffic flows representing the 
aforementioned service level classes. 
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The theoretical basis of the method according to the invention will become apparent 
from the following examination. 

5 In the weighting-coefficient-based scheduling system, a sequence indication (for 

example, Start_tag SFQ in method [1]) is arranged for the packet in the input to the 
scheduler 1, to state when the packet in question will be in turn for forwarding. The first 
packet to be forwarded is that with a sequence indication value stating the earliest 
forwarding moment. The sequence indication need not be bound to real time, it is 
10 sufficient if the sequence indications of the packets are in a sensible relation to each 
other. 

When forming the sequence indication, a weighting coefficient corresponding to the 
service level class in question is used for packets coming from a specific service level 
15 queue. If queue J 1 has a greater weighting coefficient than queue J2, then the series of 
sequence indications of the consecutive packets of queue Jl, relative to those of the 
corresponding ones of queue J2 is formed to be such that the queue Jl receives a larger 
portion of the capacity of the output of the scheduler 1. 

20 In the priority-based scheduling system, the packets in the input of the scheduler are 

given a priority value. The priority values of the packets determine which packet is the 
next to be forwarded. 



In the method according to the invention, the priority value given to the packet, or the 
25 weighting coefficient used in forming the sequence indication does not depend only on 
the service level class represented by the packet (which is this publication is referred to 
as the variable q), but also on the result (which in this publication is referred to as the 
variable vector x) provided by the measurement 3 made from the traffic flow of the 
service level class in question on from the portion of the traffic flow in question. Figure 
30 3. 



In the method according to the invention, the measurement datum/data can also 
determine whether the weighting coefficient or priority-based scheduling method is used 
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to make the scheduling decision for a specific packet. 

In congestion management, the length of the queue or a variable derived from it, such as 
a low-pass filtered value, is utilized. If the length of the queue and/or its derivative 

5 exceeds a specific threshold value, congestion limitation measures are applied to specific 
packets in the queue or arriving in it. The congestion limitation measures can be packet 
dropping (discarding) or marking (ECN method [2]). The selection of packets within a 
specific service level class, to which the congestion limitation measures are applied, is 
based on sub-group information (e.g., drop precedence) in a congestion management 

10 method representing the prior art. The principle is that, for example, in the case of class 
aG+E, the congestion limitation measures are applied to the packets representing the 
service level class in question, which, on the basis of the sub-group information, belong 
to the E portion. If the increase in the length of the queue does not stop by dropping (or 
marking) packets representing the E portion, packets representing the G portion are also 

15 begun to be dropped (marked). In the WRED method, this is implemented in such a way 
that the threshold value of the queue length, or the variable derived from it defined for 
the G portion is greater than the corresponding threshold value defined for the E portion. 
Unless overbooking is being used, the dropping (or marking) of purely packets 
representing the E portion should already prevent congestion, as the necessary 

20 transmission capacity has been reserved for the traffic representing the G portion. If 
overbooking is used, the queue can continue to increase even in a situation in which 
congestion limitation measures are already being applied to all packets representing the 
E portion. This is due to the fact that, when using overbooking (as defined), it is possible 
for a greater amount of traffic representing the G portion, than the transmission capacity 

25 reserved for the G portion to attempt to reach the transnMSsion link. In that case, the 
length of the queue is limited on the basis of the threshold value defined for the G 
portion. However, in terms of the delay behaviour and flow management (e.g., TCP), it 
is preferable for the length of the queue to remain as short as possible. This is attempted, 
for example, using the WRED algorithm in such a way that the threshold value, after 

30 which packets representing the E portion are begun to be dropped (or marked) is low. On 
the other hand, a low threshold value cannot be used for the G portion, in order to 
achieve a clear dropping/marking hierarchy - limitation measures are applied first of all 
to the E portion and only after that to the G portion. Thus, in a congestion situation 
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caused by overbooking, for example, the basic objective of the WRED algorithm of 
keeping the queue short is not met. 

In the method according to the invention, the problem described above relating to the 
5 length of the queue is solved by using measurement results x, Figure 3, instead of, or 
along with the sub-group information in congestion management. 

The following illustrates the operation of the scheduling and congestion management 
method according to one embodiment of the invention in the case of traffic flows 
10 belonging to the classes aG+E and bG+E, using the SFQ scheduling algorithm [1] and 
the WRED congestion management algorithm [3, 4]. In this embodiment of the 
invention, the packet-specific weighting coefficient is defined on the basis of the 
measurement results as follows: 

15 For the portion of a traffic flow representing the service level class aG+E, for which, in 
the case of the packet being examined, the measured number of bits transmitted is, 
during an arbitrary examination period T from the past to the present less than CIR x T + 
CBS, the packet-specific weighting coefficient Wac+E = Wga, for the excess portion 
WaG+E = WEa- Correspondingly, in the service level class b+E, the weighting coefficient 

20 Wbo+E = Wob or Wfib- CIR is the available transmission band (committed information 
rate [bit/s]) reserved from the G portion of the service level class, which, when using 
overbooking is less than the largest possible amount [bit/s] of traffic representing the G 
poition. CBS is the largest permitted burst size [bit] (committed burst size). The 
measurement described here can be implemented using, for example, the Token Bucket 

25 method [7]. 

The portion of traffic formed of packets belonging to the aG+E (bG+E) service level 
class, for which Wbc+E = Woa (Wbc+E = Wcb) is valid, will subsequently be termed the g 
portion, and correspondingly the portion of the traffic formed of packets, for which 
30 Wao+E = Wga (WbG+E = Wfib), wiU be termed the e portion. 

Ttie sequence indications (SaG+E(i) and SbG+E(j)) of an aG+E class packet i and of a 
bG+E class packet j are calculated as follows: 



wo 2005/039122 



PCT/FI2004/0(M>610 



SaG+E(i) = max {v, S^g+e (i -1) + L(i -l)AVaG+E), 
SbG+eO) = max { V, SbG+E (j -1) + L(j -l)/WbG+E}, 



(1) 



(2) 



10 



15 



20 



25 



in which L(i -1), L(j n-1) are the size of the preceding packet (for example, in bits) and v 
is the sequence indication (virtual time) of the packet being forwarded at the time of 
inspection. The sequence induction is calculated when the packet arrives at the quality- 
level-specific input of the SFQ mechanism, nor it is updated later, even if v changes. The 
next packet to be forwarded is selected as the packet (i or j) with the smaller sequence 
indication. 

A simple test or simulation can be used to demonstrate the following: if the packets of 
service level class aG+E being transmitted during a sp)ecific period of time belong to the 
g portion of aG+E and the service level class bG+E packets being transmitted belong to 
the e portion of bG+E, then the ratio of the bytes (or bits) carried by the aG+E and bG+E 
service level class packets being transmitted during the period in question is Wca: Web- 
The examination gives a better illustration, if all the packets are assumed to be of the 
same size. It is then possible to speak simple of packets, instead of speaking of packets 
representing bits or bytes. By selecting suitable weighting coefficients Wot, Web, Wcb, 
Web, it is possible to define how many packets representing the g or e portions of the 
service level class aG+E are transmitted relative to the packets representing the g or e 
portions of the service level class bG+E. 

One variation of this embodiment is created in such a way that Woa = Wcb, Wsa = Web, 
and Wca » Wea (Wcb » WEbX e.g., Wca = 10 000 x Wea. In fact, this corresponds in 
practice to the packets belonging to the g portion being scheduling using a priority 
principle in such a way that the g portions of the service level classes aG+E and bG+E 
have a mutually equal scheduling priority. This is possible, because the g portions are 
limited in such a way that the transmission band they require is available. 

In this embodiment of the invention which is described, the selection of the packets 
inside the service level class aG+E or bG+E, to which congestion limitation measures 
are applied, is not based on sub-group information, but instead of whether the packet 
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being examined belongs to the g or e portion. The principle is that congestion limitation 
measures are applied first of all to packets representing the e portion. If the increase in 
the length of the queue does not stop by dropping (or marking) packets representing the 
e portion, packets representing the g portion are also begun to be dropped (marked). In 
the WRED method, this is implemented in such a way that the threshold value defined 
for the g portion, which the length of the queue or a derivative of it must exceed before 
packets belonging to the g portion are begun to be dropped (or marked), is greater than 
the corresponding threshold value defined for the e portion. 

Because the g portions of the service level classes aG+E and bG+E are limited in such a 
way that the transmission band required by them is available, the dropping (or marking) 
of purely the packets representing the e portions will already prevent congestion. Thus, 
the length of the queue in congestion situations is determined by the threshold value set 
for the e portion, which can be selected to be low. 

One prefeired variation of this embodiment is achieved in such a way that the measuring 
function is applied only to the G portion and the packets that do not belong to the G 
portion are processed in the e portion. Thus, it is possible to ensure that the greatest 
possible share of the packet that belong to that portion of the traffic, which corresponds 
to the transmission speed promised in the service level agreement (G portion), will be 
processed in the g portion. The application of the measurement to only the G portion can 
be implemented on the basis of sub-group information (p, e.g.. drop precedence). 

References: 



[1] Pawan Goyal, Hairic M. Vin, Haichen Cheng. Start-time Fair Queuing: A 
Scheduling AlgoritJtmfor Integrated Services Packet Switching Networks. Technical 
Report TR-96-02. Department of Computer Sciences, University of Texas Austin. 

[2] Bruce Davie, Yakov Rekhter. MPLS Teclmology and Applications. Academic Press 
2000 CA U.S.A. (www.academic.press.com) 



[3] Sally Floyd, Van Jacobson. Random Early Detection Gateways for Conges^ 



tion 



wo 2005/039122 



PCT/FI2004/000610 



14 

Avoidance. Lawrence Berkeley Laboratory 1993, University of California. 

[4] A description of the WRED algorithm can be found at the Internet address: 
http://www.iumper.net/techncenter/techpapers/200021-01 .html . 

5 

[5] Janne Vaananen. Menetelmdja Laitteisto Siirtoyhteyskapasiteetin Vuorottamiseksi 
Packettikytkent&isten Tietoliikennevoiden Kesken (Method and Equipment for 
Sequencing Tratismission Capacity Between Packet-Switched Data Traffic Flows), 
Finnish patent application No. 20021921, Helsinki Finland 2002. 

10 

[6] Douglas E. Comer. Internetworking with TCP/IP, Third Edition. Prentice Hall 
International Editions, U.S.A. 1995. 

[7] P.F. Chimento. Standard Token Bucket Terminology. 
15 http://qbone.intemet2.edu/bb/Traffic.pdf 2000. 



